LogDoor User's Guide

LogDoor Files

LogDoor Task files

You specify the details of how LogDoor should process a log through LogDoor task files. Each LogDoor task file specifies an input log file to process, and output file and scheduling information. A LogDoor task file also holds a pointer to the last byte in the input log file which has been processed by the task. Finally the task file maintains a complete summary, on a day-by-day, site-by-site basis, of all the information it has processed to date. This information is displayed in the task status window and written, on a periodic basis, to the site-by-site summary reports.

In many cases you will only need one LogDoor task file. If however, you have multiple log files, or want to process the log file in different ways, you may want to set up a number of LogDoor task files. LogDoor can only have one task file open at a time. You can create a new LogDoor task file, or choose one that's been created previously, only if no other task file is currently opened. Use "Close Task" from the File menu to close the currently open LogDoor task (and terminate processing if any is ongoing). Use "New Task" to create a new task file, or "Open Task" to open one which has previously been created.

As indicated above, the LogDoor task file contains all of the state information associated with the task's processing. This information is checkpointed out to disk on a periodic basis. If you wish to be sure this information is saved at a specific time, you can choose "Checkpoint Now" from the Task menu

To remove an old site from the state information associated with the task, and from the task status window, select the site in the task status window and choose "Remove Site" from the Task menu. All state information about the site will be deleted, and no future log files or reports for the site will be written (unless a valid access to the site is encountered during subsequent processing).

LogDoor's Output Files

Much of LogDoor's usefulness comes from the output files which it produces. This section provides details of those files.

LogDoor's output files are based on the "site" indicated in each entry of the input log file. LogDoor defines the site as the highest-level folder indicated in the URL field of the entry. If there is no folder in the URL, the site is defined as "root." Here are some examples:

    URL                          Site

    :Site1:default.html          Site1
    :Site1:folder:file.html      Site1
    :Site2:f1:f2:f3:file.html    Site2
    :default.html                Root
    :file.html                   Root

The following figure presents an overall summary of the type and structure of a LogDoor task's output files.


As shown above, all of a task's output is written into the folder indicated to the task through the Output Files button or menu item. If Output Files is never selected, the task's output is written into a folder named "Output" in the same folder as the task itself.

For each site that the task encounters in the input log, the task looks in its output folder for a folder with that site name. If one is not found, the task creates a new folder for the site. Either way, the task then writes its output files into that folder, creating new files as necessary. If the folder is actually an alias, the task writes its output files into the folder pointed to by that alias, allowing the files to be written, for instance, into folders at different places within your Web hierarhcy (see Appendix 1 for some examples).

A LogDoor task can create up to three files for each site that the task encounters in the input log:

You can use the Output Files dialog to indicate which of these files should be produced by a LogDoor task.

When writing output logs, the LogDoor task echoes any input log format line to each output log file, so that log post-processing applications will be able to correctly process LogDoor output logs. Site logs are always appended to and are never reset by a LogDoor task.

Site logs, like all three of LogDoor's output file types, are named based on the site name. LogDoor appends a user-specified suffix to the site name to form the output file name. You can specify the suffix in the Output Files dialog. The default suffix used for log files is ".log", so the name of a log file will generally have the format <site-name>.log.

In addition to indicating that a file is of a specific type, suffixes are useful for changing the names of LogDoor's output files at a particular time. For instance, you could use a suffix which included the month name, and change that suffix at the end of every month, causing LogDoor to write out new logs each month. If a suffix causes the resulting file name to have more than 32 characters, the site name will be truncated as necessary.

The HTML and text-based summary reports are written by the LogDoor task on a periodic basis, as indicated in the Scheduling dialog box (see the "Scheduling" section). Summary reports can also be written at any time through the "Write Summaries Now" item in the Task menu. These summaries contain day-by-day totals of the task's processing in terms of the number of hits, bytes transfered and errors for pages within each site. Unlike the output log files, summaries are reset by the "Reset Summary Data" command in the Task menu. See Appendix 2 for details of these summaries. The default suffix used for HTML summaries is ".html" and for text summaries is ".txt".

A LogDoor task also periodically writes out a summary report of its processing on an overall basis. This summary is written into the folder called "Overall" (or through an alias by that name) within the LogDoor output folder. The HTML overall summary report is named using the format overall<suffix>, and the text summary using the format overall<suffix>.

LogDoor's HTML summary reports can be customized by placing a text file named "Heading.html" in the same folder as the task file, and the text-based summary reports can be customized in the same way with a file name "Heading.txt". See Appendix 2 for details of these files.

LogDoor Input files

The input file for a LogDoor task can be any valid WebSTAR or MacHTTP log file. WebSTAR log files should begin with the standard format line written by WebSTAR when it starts a new log file or changes its log file format. If a LogDoor input file does not begin with a format line, the LogDoor task will derive the format of that file based on the first entry in the file.

Whenever a new input file is selected for a LogDoor task, the task will automatically configure itself to start processing at the beginning of that input file. If a task has already processed some of the current input file, but you wish to have the task restart processing at the beginning of that file, simply re-select the input file using the Input Log File button or Input File menu item.

A LogDoor task will process its input file until it has reached the end of that file. Once it has reached the file's end, the task will continue scanning that file for any new data which may be added by the Web server, and it will automatically process that data. In this way LogDoor can serve as a real-time monitor of your Web server's activity.

Back to Table of Contents
Back to LogDoor Basics
Forward to Scheduling